[SPARK-35270][SQL][CORE] Remove the use of guava in order to upgrade guava version to 27 #32395

wForget · 2021-04-29T08:47:48Z

What changes were proposed in this pull request?

Remove the use of guava in order to upgrade guava version to 27.

Why are the changes needed?

Hadoop 3.2.2 uses Guava 27, the change is for the guava version upgrade.

Does this PR introduce any user-facing change?

no

How was this patch tested?

Modify the guava version to 27.0-jre, and then compile.

AmplabJenkins · 2021-04-29T08:49:14Z

Can one of the admins verify this patch?

srowen

Looks good, avoiding Guava in favor of the JDK classes.
Is that all the usage of com.google.common.base.Objects ?

srowen · 2021-04-29T13:40:43Z

.../network-shuffle/src/main/java/org/apache/spark/network/shuffle/RemoteBlockPushResolver.java

It isn't super important here I think, but does this result in the same string?

The results are different. Using Objects to return is like "AppShuffleId{appId=appId, shuffleId=100}", using ToStringBuilder to return is like "RemoteBlockPushResolver.AppShuffleId[appId=appId,shuffleId=100]". Will it cause some problems?

Like with hash function changes, it shouldn't matter to programs. But if some program did rely on it, directly or accidentally, this might break. It's a tough call - how much is the change worth? overall it's an OK improvement but yeah I'm hesitant for just this reason. It's more the hash change than this one.

sunchao · 2021-04-29T16:30:25Z

@wForget Spark master branch has already moved to use shaded Hadoop client by default (see SPARK-33212) which effectively isolated itself from the Guava version on the Hadoop side. Did you actually see a Guava conflict issue?

dongjoon-hyun · 2021-04-29T16:31:31Z

+1 for @sunchao 's comment. Also, Apache Spark 3.2 is moving toward to Hadoop 3.3.x.

srowen · 2021-04-29T18:56:56Z

It may be unnecessary for the reason above; it still probably wouldn't hurt to just move these to standard JDK classes. I have a little bit of worry about changing behavior, with a possibly different hash or toString, though

sunchao · 2021-04-29T18:58:56Z

@srowen yes agreed - it's better to avoid Guava usage in general if it's not necessary.

dongjoon-hyun · 2021-04-30T04:36:53Z

Ya, different hash bites us always; at Scala version changes and at Spark versions change.
For this case, I'm not sure, but I'll leave this up to your decisions, @srowen and @sunchao .

BTW, it seems that we need to revise the wrong title and PR description about Hadoop 3.2.2. Could you make this PR neutral from Hadoop, @wForget .

wForget · 2021-04-30T05:47:33Z

@sunchao @dongjoon-hyun @srowen
Sorry, my description here is not accurate. The conflict caused by the program introduced multiple versions of guava, I tried to modify the guava version to 27 and found a compilation problem.

HyukjinKwon · 2021-05-03T05:03:58Z

@wForget can you enable GitHub Actions in your forked repository? https://github.com/apache/spark/pull/32395/checks?check_run_id=2465058510

wForget · 2021-05-06T02:27:37Z

@HyukjinKwon I have enabled it, how to rerun these checks?

HyukjinKwon · 2021-05-06T03:22:12Z

Did you do something like #32400 (comment) too? If it's done, feel free to rebase which should retrigger the test.

…guava version to 27.

pan3793 · 2021-05-31T05:27:39Z

Seems spark-core already shaded guava, and for Hadoop 3.2, since spark already moved to Hadoop Shaded Client, I only see Curator depends on guava, from https://cwiki.apache.org/confluence/display/CURATOR/TN13 , I think it's ok to bundle a high version of guava in Spark hadoop-3.2 binary dist?

srowen · 2021-06-01T13:44:12Z

I think the concern about changing behavior still stands?

pan3793 · 2021-07-26T00:50:59Z

any update?

github-actions · 2021-11-04T00:10:32Z

We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!

github-actions bot added CORE MLLIB SQL YARN labels Apr 29, 2021

srowen reviewed Apr 29, 2021

View reviewed changes

wForget changed the title ~~[SPARK-35270][SQL][CORE] Remove the use of guava to fix Hadoop 3.2.2 guava conflict.~~ [SPARK-35270][SQL][CORE] Remove the use of guava in order to upgrade guava version to 27 Apr 30, 2021

wForget force-pushed the master-gauva-compatible branch from 28df0ec to 46e1eee Compare May 6, 2021 02:05

[SPARK-35270][SQL][CORE] Remove the use of guava in order to upgrade …

d37c843

…guava version to 27.

wForget force-pushed the master-gauva-compatible branch from 46e1eee to d37c843 Compare May 6, 2021 03:47

github-actions bot added the Stale label Nov 4, 2021

github-actions bot closed this Nov 5, 2021

[SPARK-35270][SQL][CORE] Remove the use of guava in order to upgrade guava version to 27 #32395

[SPARK-35270][SQL][CORE] Remove the use of guava in order to upgrade guava version to 27 #32395

Uh oh!

Conversation

wForget commented Apr 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

AmplabJenkins commented Apr 29, 2021

Uh oh!

srowen left a comment

Choose a reason for hiding this comment

Uh oh!

srowen Apr 29, 2021

Choose a reason for hiding this comment

Uh oh!

wForget Apr 30, 2021

Choose a reason for hiding this comment

Uh oh!

srowen Apr 30, 2021

Choose a reason for hiding this comment

Uh oh!

sunchao commented Apr 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dongjoon-hyun commented Apr 29, 2021

Uh oh!

srowen commented Apr 29, 2021

Uh oh!

sunchao commented Apr 29, 2021

Uh oh!

dongjoon-hyun commented Apr 30, 2021

Uh oh!

wForget commented Apr 30, 2021

Uh oh!

HyukjinKwon commented May 3, 2021

Uh oh!

wForget commented May 6, 2021

Uh oh!

HyukjinKwon commented May 6, 2021

Uh oh!

pan3793 commented May 31, 2021

Uh oh!

srowen commented Jun 1, 2021

Uh oh!

pan3793 commented Jul 26, 2021

Uh oh!

github-actions bot commented Nov 4, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

wForget commented Apr 29, 2021 •

edited

Loading

sunchao commented Apr 29, 2021 •

edited

Loading